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Abstract 

Improvements in data acquisition and processing techniques have lead to an almost continuous 
flow of information for financial data. High resolution tick data are available and can be quite 
conveniently described by a continuous time process. It is therefore natural to ask for possible ex- 
tensions of financial time series models to a functional setup. In this paper we propose a functional 
version of the popular ARCH model. We will establish conditions for the existence of a strictly 
stationary solution, derive weak dependence and moment conditions, show consistency of the es- 
timators and perform a small empirical study demonstrating how our model matches with real data. 
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1 Introduction 

To date not many functional time series models exist to describe sequences of dependent 
£f~) observations. Arguably the most popular is the ARH(l), the autoregressive Hilbertian 

process of order 1. It is a natural extension of the scalar and vector valued AR(1) process 
(cf. Brockwell and Davis [8j). Due to the fact that the ARH(l) model is mathematically 
and statistically quite flexible and well established, it is used in practice for modeling and 
prediction of continuous-time random experiments. We refer to Bosq [U] for a detailed 
T _— l treatment of moving averages, autoregressive and general linear time series sequences. 

Despite the prominent presence in time series analysis it is clear that the applicability 
of moving average and autoregressive processes is limited. To describe nonlinear models 
$—i in the scalar and vector number of different approaches have been introduced 

in the last decades. One of the most popular ones in econometrics is the ARCH model 
of Engle |14| and the more general GARCH model of Bollerslev [5] which have had an 
enormous impact on the modeling of financial data. For surveys on volatility models 
we refer to Silvennoinen and Terasvirta [25J. GARCH-type models are designed for the 
analysis of daily, weekly or more general long-term period returns. Improvements in data 
acquisition and processing techniques have lead to an almost continuous flow of information 
for financial data with online investment decisions. High resolution tick data are available 
and can be quite conveniently described as functions. It is therefore natural to ask for 
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possible extensions of these financial time series models to a functional setup. The idea 
is that instead of a scalar return sequence {y^, 1 < k < T} we have a functional time 
series {?/fc(i), 1 < A; < T, < t < S}, where yk(t) are intraday (log-)returns on day k at 
time t. In other words if {Pf.(t), 1 < k < T, < t < S} is the underlying price process, 
then yk(t) = logPfc(i) — log -Pfc(i — h) for the desired time lag h, where we will typically 
set h = 5min. By rescaling we can always assume that 5=1 and then the interval [0, 1] 
represents one trading day. 

We notice that a daily segmentation of the data is natural and preferable to only one 
continuous time process {y(s), < s < T}, say, for all T days of our sample (cf. Harrison et 
al. [18] . Barndorff- Nielsen and Shephard jl], Zhang et al. [28], Barndorff- Nielsen et al. |3], 
and Jacod et al. |20j). Due to the time laps between trading days (implying e.g. that 
opening and closing prices do not necessarily coincide) one continuous time model might 
not be suitable for a longer period. Intraday volatilities of the euro-dollar rates investigated 
by Cyree et al. [10J empirically can be considered as daily curves. Similarly, Gau [T7] 
studied the shape of the intraday volatility curves of the Taipei FX market. Angelidis and 
Degiannakis [1] compared predictions based on intra-day and inter-day data. Elezovic [13j 
modeled bid and ask prices as continuous functions. The spot exchange rates in Fatum and 
Pedersen |16] can be considered as functional observations as well. Evans and Speight [15] 
uses 5-min returns for Euro-Dollar, Euro-Sterling and Euro- Yen exchange rates. 

In this paper we propose a functional ARCH model. Usually time series are defined 
by stochastic recurrence equations establishing the relationship between past and future 
observations. The question preceding any further analysis is whether such an equation 
has a (stationary) solution. For the scalar ARCH necessary and sufficient conditions have 
been derived by Nelson [23J. Interestingly, these results cannot be transferred directly to 
multivariate extensions (cf. Silvennoinen and Terasvirta [25]). Due to the complicated 
dynamics of multivariate ARCH/GARCH type models (MGARCH), finding the necessary 
and sufficient conditions for the existence of stationary solutions to the defining equa- 
tions is a difficult problem. Also the characterization of the existence of the moments in 
GARCH(p, q) equations is given by very involved formulas (cf. Ling, S. and McAleer [22]). 
It is therefore not surprising that in a functional setup, i.e. when dealing with intrinsi- 
cally infinite dimensional objects, some balancing between generality and mathematical 
feasibility of the model is required. 

In Section [2] we propose a model for which we provide conditions for the existence of a 
unique stationary solution. These conditions are not too far from being optimal. We will 
also study the dependence structure of the model, which is useful in many applications, e.g. 
in estimation which will be treated in Section [3] We also provide an example illustrating 
that the proposed functional ARCH model is able to capture typical characteristics of high 
frequency returns, see Section [4] 

In this paper we use the following notation. Let F denote a generic function space. 
Throughout this consists of real valued functions with domain [0, 1]. In many applications 
F will be equal to %=L 2 ([0, 1]), the Hilbert space of square integrable functions with norm 
\\x\\-}{ = ( Jq x 2 (s)ds) 1 ^ 2 which is generated by the inner product (x,y) = x(s)y(s)ds 
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for x,y E H. Another important example is F = C[0, 1]. This is the space of continuous 
functions on [0, 1] equipped with the sup-norm ||x||oo = su Ptg[o,i] By F + we denote 

the set of non-negative functions in F. To further lighten notation we shall often write x 
when we mean {x(t), t E [0, 1]}, or f3 for integral kernels {/3(t,s), < i < 1, < s < 1} 
as well as for the corresponding operators. If x, y E F then xy stands for pointwise 
multiplications, i.e. xy = {x(s)y(s), s E [0, 1]}. Since integrals will always be taken over 
the unit interval we shall henceforth simply write J x(t)dt. A random function X with 
values in % is said to be in if v v {X) = teHXH^) 1 ^ < oo. 



2 The functional ARCH model 



We start with the following general definition. 

Definition 2.1. Let {£&} be a sequence of independent and identically distributed random 
functions in F. Further let (5 : F + — > F + be a non-negative operator and let 5 E F + . Then 
an F -valued process {yk(s), k E s E [0, 1]} is called a functional ARCH(l) process in F 
if the following holds: 

Vk = £kO-k (2.1) 

and 

°l = 5 + P(&-l). (2.2) 



The assumption for the existence of processes satisfying (2.1 ) and (2.2 ) depends on the 
choice of F. So next we specify F and put some restrictions on the operator f3. Our first 
result gives a sufficient condition for the existence of a strictly stationary solution when 
F = %. We will assume that f3 is a (bounded) kernel operator defined by 

p(x)(t) = J 0(t,s)x(s)ds, x£H. (2.3) 

Boundedness is e.g. guaranteed by finiteness of the Hilbert-Schmidt norm: 

\\P\\% = J J (3 2 (t,s)dsdt <oo. (2.4) 

Theorem 2.1. Let {yk} be the process given in Definition \2. 1\ with F = % and (3 given in 
(2.3), such that the operator (3 is bounded. Define K(ef) = (ff /3 2 (i, s)ef(s)dsdt) . If 



there is some a > such that E{K(£±)} a < 1, then ( |2.1[ ) and ( |2.2[ ) have a unique strictly 
stationary solution in H. Furthermore, o\ is of the form 

a\ = g(e k -i,e k - 2 , ■ ■ ■), (2-5) 
with some measurable functional g : Ti^ — > T~L. 

It follows that {<7fc} and {yk} are not just strictly stationary but also ergodic (cf. 



Stout |26|). Let T k be the u-algebra generated by the sequence {si, i < k}. If (2.1) 
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and (2.2) have a stationary solution and if we assume that Ee k (t) = 0, Ee 2 (t) < oo and 
Eaf,{t) < oo for all t G [0, 1], then due to (|2,5|) it is easy to see that 



corr 



{yk(t),yk(s)\T k -i} = coTv{e k (t),e k (s)}. 



Since by our assumption {e k , k G Z} is stationary, the conditional correlation is indepen- 
dent of k and can be fully described by the covariance kernel C £ (t,s) = Cov(e(t),e(s)). 
However, we have Cov{y k (t), yk(s)\J- k -i} = o~ k (t)o~ k (s)C £ (s,t). This is in accordance with 
the constant conditional correlation (CCC) multivariate GARCH models of Bollerslev |5] 
and Jeantheau 



Our next result shows that a? of (2.5) can be geometrically approximated with m- 



dependent variables, which establishes weak dependence of the processes Q2 - 1 p and (|2.2|). 
Theorem 2.2. Assume that tf 

dent copy of {e k } and define a\ m = g(e k - 1 ,e k -2, ■ ■ ■ , e k 



Theorem 2.2. Assume that the conditions of Theorem 2.1 hold. Let {£ k } be an indepen- 

Then 



mi e k - m -l 



•m-2> 



E{\\ol-alJ\ H }°<<cr* 
with some < r = r(a) < 1 and c = c(a) < oo. 



(2.6) 



To better understand the idea behind our result we remark the following. Assume that 
we redefine 
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9{ £ k-li £ k-2, ■ ■ ■ j £ k- 



™> £ k—m—l,k—m> c fc-m-2,fc-m' ' • 



(*) 



(k) 



where {e^ ,£, i,k G Z} are independent copies of {e£, £ G Z}. In other words, every a'^ gets 

its "individual" copy of {e^} to define the approximations. It can be easily seen that then 
for any fixed m > 1, {& km , k G Z} form m-dependent sequences, while the value on the left 



hand side in inequality (2.6) doesn't change. As we have shown in our recent papers [2j and 
[19J, approximations like (2.6) are particularly useful in studying large sample properties 



of functional data. We use (2.6) to provide conditions for the existence of moments of 



the stationary solutions of (2.1) and (2.2). It also follows immediately from (2.6), that if 



(2.1 ) and (2.2 ) are solved starting with some initial values y$ and o"q, then the effect of the 
initial values dies out exponentially fast. 

In a finite dimensional vector space all norms are equivalent. This is no longer true in 



the functional (infinite dimensional) setup and whether a solution of (2.1) and (2.2) exists 
depends on the choice of space and norm of the state space. Depending on the application, 
it might be more convenient to work in a different space. We give here the analogue of 



Theorems 2.1 and 2.2 for a functional ARCH process in C[0, 1]. 



Theorem 2.3. Let {y k } be the process given in Definition 2.1 with F = C[0, 1] and define 
H{e\) = sup <t<i / f3(t, s)Ei(s)ds. If there is some a > such that E{H (ef)} a < 1, then 



(2.1) and (2.2) have a unique strictly stationary solution in C[0, 1]. Furthermore, a\ can 



be represented as in (2.5). In addition the proposition of Theorem 2.2 holds, with (2.6) 
replaced by 



E{\\a 



a 



km I loo 



} a < cr r 
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We continue with some immediate consequences of our theorems. We start with con- 
ditions for the existence of the moments of the stationary solution of (2.1) and (2.2). 



Proposition 2.1. Assume that the conditions of Theorem 2.1 hold. Then 



If 



and 



then 



^{Ikollw}" < oo 

^{Ikolloo}" < OO, 

E {\\yo\\n} a < oo- 



(2.7) 

(2.8) 
(2.9) 
(2.10) 



Proposition 2.2. Assume that the conditions of Theorem 2.3 hold. Then the analogue of 



Proposition 2.1 holds, with \\ • ||% in (2.7)-(2.10) replaced by 



We would like to point out that it is not assumed that the innovations have finite 
variance. We only need that Sk have some moment of order a > 0, where a > can be 
as small as we wish. Hence our model allows for innovations as well as observations with 
heavy tails. 



According to Propositions 2.1 and 2.2 if the innovation Sq has enough moments, then 
so does <7q and i/q. The next result shows a connection between the moduli of continuity 
of £o and yo. Let 

oj(x, h) = sup sup \x(t + s) — x(t)\ 

0<t<l-h0<s<h 

denote the modulus of continuity of a function x(t). 



Proposition 2.3. We assume that the conditions of Theorem 2.3 are satisfied with a - 
p > 0. If E{ ||eo||oo} P < oo and lim^o E{^(eo, h)} p = 0, then lim^o E^iuo, h)} p = 0. 



According to Theorems 2.2 and 2.3, the stationary solution of (2.1) and (2.2) can be 



approximated with stationary, weakly dependent sequences with values in Ti and in C[0, 1], 
respectively. We provide two further results which establish the weak dependence structure 
of {yk}- 



Proposition 2.4. We assume that the conditions of Theorem 2.1 are satisfied with a 
and 



-E{||eo||oo} P < oo. 



Then 



E{\\yk - ykm\\n} p < n m , -oo < k < oo,m > 1, 
with some < c < oo and < 7 < 1, where yk m = E k°~km- 



(2.11) 
(2.12) 
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It follows from the definitions that the distribution of the y k — y km does not depend 



on k. Hence the expected value in (2.12) does not depend on k. A similar result holds in 
F = C[0, 1] under the sup-norm. 



Proposition 2.5. We assume that the conditions of Theorem 2.3 are satisfied with ot = % 
and that (2.11) holds. Then 

sup E\yk — t/km\ p < C7 m , — oo < k < oo, m > 1, (2-13) 

0<t<l 

with some < c < oo and < 7 < 1, where y km = £kO~km- 



As in case of Proposition 2.4 the expected value in (2.13) does not depend on k. 



3 Estimation 



In this section we propose estimators for the function 5 and the operator (3 in model 



(2.1 )-(2.2 ) which are not known in practice. The procedure is developed for the important 



case where F = Ti and /3 is given as in (2.3). We show that our problem is related to the 



estimation of the autocorrelation operator in the ARH(l) model which has been intensively 
studied in Bosq [6j. However, the theory developed in Bosq [6] is not directly applicable as 
it requires independent innovations in the ARH(l) process, whereas, as we will see below, 
we can only assume weak white noise (in Hilbert space sense). 

We will impose the following 
Assumption 3.1. (a) Ee^(t) = 1 for any t G [0, 1]. 



(b) The assumptions of Theorem 2.2 hold with a = 2. 



Assumption 3.1 (a) is needed to guarantee the identifiability of the model. Part (b) of 



the assumption guarantees the existence of a stationary solution of the model (2.1)-(2.2) 



with moments of order 4. It is necessary to make the moment based estimator proposed 



below working. An immediate consequence of Assumption 3.1 is that (2.4) holds, i.e. /3 is 
a Hilbert Schmidt operator. 

We let 1712 denote the mean function of the y? and introduce 

"k = Vk-<>k = i(4(s) - ljofts), s e [0, 1]}. 



Then by adding on both sides of (2.2) we obtain 

y\ = $ + PhU-i) + »k- 

Since (3 is a linear operator we obtain after subtracting m<i on both sides of the above 
equation 



Vk- m 2 



5-m 2 + /3(m 2 ) + P{y 2 k _i - m 2 ) + v k . 



(3.14) 
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It can be easily seen that under Assumption 3.1 Ev k = (where stands for the zero 
function). Notice also that the expectation commutes with bounded operators, and hence 
that E(j3(yu — ^2)) = P(E(y? — 1112)) = 0. Consequently, taking expectations on both 
sides of (3.14) yields that 



5 - m 2 + f3(m 2 ) = 0. 



Thus, (3.14) can be rewritten in the form 

Z k = fi(Zk-i) + v k with Z k 



y k -m 2 . 



(3.15) 



(3.16) 



Model (3.16) is the autoregressive Hilbertian model of order 1, short ARH(l). For 
estimating the autocorrelation operator /3 we may use the estimator proposed in Bosq [6| 
Chapter 8]. We need to be aware, however, that the theory in [6] has been developed for 
ARH processes with strong white noise innovations, i.e. independent innovations {v k }. In 
our setup the {vk} form only a weak white noise sequence, i.e. for any n 7^ m we have 

E \Wn\\n <00 and E(v n ,x)(v m ,y) = Vx,y£?i, 
and the covariance operator of v n is independent of n. Thus the theory in |6j cannot be 



directly applied. We will study the estimation of (3 in Section 3.1 



Once P is estimated by some f3 say, we obtain an estimator for 5 via equation ( 3.15[ ): 

5 = m 2 -P(rh 2 ), (3.17) 



where we use 



m 2 



1 N 



(3.18) 



k=l 



Let \\f3\\c = suPxewdl/^MIIw : ll x ll — 1} t> e the operator norm of ft. Recall that 
< II/3H.S. The following Lemma shows that consistency of (3 implies consistency of 5. 



3.1 



hold. Let 5 = 5n be given as in (|3.17|). Then 



Lemma 3.1. Let Assumption 

\\6 N -5\\ H = O p (1) x (n-V 2 + \\$ n 

Proof. We have 

\\S N ~ S\\ n < \\m 2 - m 2 \\ H + \\P{m 2 ) - (3(m 2 )\\ n + ||/3(m 2 ) - P(m 2 )\\ n 
< \\m 2 - m 2 \\ H + H/3 - /S||£||m 2 ||^ + ||/3||£||m 2 - m 2 \\ H . 

The result follows once we can show that \\m 2 — m 2 \\u = \\rh 2) N — m '2\\'H = Op(N^ 1 ^ 2 ). 
To this end we notice that by stationarity of {y k } 



E\\rh 2 — m 2 \ 



H 



E 
1 

N 
1 



1 N 



k=l 
N-l 

E 

k=-(N-l) 



H 



\k\ 
N 



E(yl - m 2 ,yl - m 2 ) 



- n I E hl ~ m ^\\u + 2 Z^\ E ^ ~ m2 ' y k ~ m2 )\ 



k=l 
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By construction t/q and the approximation y kk are independent. Repeated application of 



the Cauchy-Schwarz inequality together with Assumption 3.1 (a) yield that 



\E{yl -m 2 ,yl - m 2 )\ = \E (y% - m 2 ,y 2 k - y 2 kk )\ 

/ I, n ||2 \ 1 / 2 / „ n 9 o 1 1 2 \ 1 / 2 

<[E\\y Q -m 2 \\ H J [E\\y k -y kk \\ H j 

1/2 / „ „ „ ll2 \i/2 



(2 \ / / 
E\\a^-m 2 \\ H j [E\\al-a 



2 ||2 
fcfell^ 



Combining these estimates with Theorem 2.2 shows that E \\rh 2 — m 2 \\^ = 0(N l / 2 ). □ 
3.1 Estimation of 

We now turn to the estimation of the autoregressive operator (5 in the ARH(l) model 



(3.16). It is instructive to focus first on the univariate case Z n = fiZ n _\ + u n , in which 
all quantities are scalars. We assume Ev n = which implies EZ n = 0. We also assume 
that |/3 1 < 1, so that there is a stationary solution such that v n is uncorrelated with Z n —\. 
Then, multiplying the AR(1) equation by Z n _\ and taking the expectation, we obtain 
71 = /37o, where 7^ = E[Z n Z n+k ] = cov(Z n , Z n+k ). The autocovariances 7^ are estimated 
in the usual way by the sample autocovariances 

. N-k 
Afk= Z 3 Z 3+k, 

so the usual estimator of j3 is f3 = 71/70- This is the so-called Yule- Walker estimator which 
is optimal in many ways, see Chapter 8 of Brockwell and Davis [8j. 

In the functional setup we will replace condition \/3\ < 1 with \\(3\\s < 1. Notice that 



this condition is guaranteed by Assumption 3.1 and that it will imply the existence of a 



weakly stationary solution of (3.16) of the form 



Z n = ^^On-j), 

where (i 3 is the j-times iteration of the operator f3 and (3° is the identity mapping. The 
estimator for the operator (3 obtained in [6] is formally analogue to the scalar case. We 
need instead of 70 and 71 the covariance operator 

C (-) = E[(Z 1 ,-)Z 1 ] 

and the cross-covariance operator 

C 1 (-) = E[(Z 1 ,-)Z 2 ]. 
One can show by similar arguments as in the scalar case that 

p = c x c- x . 
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To get an explicit form let Ai > A2 > • • • be the eigenvalues of C and let e±, e 2 , ■ ■ ■ be 
the corresponding eigenfunctions, i.e. C(ej) = Ajej. We assume that ej are normalized to 
satisfy ||ej||% = 1. Then {ej} forms an orthonormal basis (ONB) of H and we obtain the 
following spectral decomposition of the operator C: 



C(y) = ^2\ j (e j ,y)e j . 



From (3.19) we get formally that 



and hence 



C-\y) = ^2Xj 1 {e j ,y)e :i , 
i>i 



(3.19) 



(3.20) 



/%) = dCT^y) = E ( Z 1 ,Y / Xj 1 (e j ,y)e j ) Z 2 



i>i / 

= J2*j 1 (ej,y)E((Z 1 ,e j )Z 2 ). 
i>i 

Using Z2 = X^>i(^2> e%)ei we obtain that the corresponding kernel is 

P(t,s) = \J 1 E((Z ll e J )(Z 2 ,e t ))e J (s)e i (t). 
ij>i 



(3.21) 



(3.22) 



If Xj = for all j > p > 1, then the covariance operator is finite rank and we can 



replace (3.19) and (3.20) by finite expansions with the sum going from 1 to p. In this 
case, all our mathematical operations so far are well justified. However, when all A,- > 
then we need to be aware that C _1 is not bounded on Ti. To see this note that Xj — > if 
j — > 00 (this follows from the fact that C is a Hilbert-Schmidt operator). Consequently, 
l|C~ 1 (e ? ')ll'H = Xj 1 — > 00 for j — > 00. It can be easily seen that this operator is bounded 
only on 



D 



i>i 3 



Nevertheless, we can show that the representation ( |3.21 ) holds for all y 6 H by using a 
direct expansion of (3(t,s). Since the eigenfunctions {e/t, k > 1} of C form an ONB of % 
it follows that {e^ 8) e^, k,£ > 1} (e& ® = {ek{s)ee(t), {s,t) S [0, l] 2 }) forms an ONB of 
L 2 ([0, l] 2 ) ='H®'H. This is again a Hilbert space with inner product 



(x,y) 



x(t, s)y(t, s)dtds. 



Note that = \\P\\s < 00 an< ^ hence the kernel function f3 G % <S) H. (Be aware, 

that for the sake of a lighter notation we don't distinguish between kernel and operator 
/3.) Consequently (3(t,s) has the representation 



ki>i 
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As we can write 



Z n +i = ^2 Pk,e(Z n , ek)e£ + v n +i 
k,£>l 



it follows that 



(Z n+ i,ei){Z n , ej) = fik,i{Z n , ek){Z n , £j) + {v n +\, e i){Z n , e-j 



k>l 



and by taking expectations on both sides of the above equation that 
E{Z 2 ,e i )(Z 1 ,e j ) = y^Pk,i{C(e k ),ej) = 

k>l 

Here we used the fact that {v^} is weak white noise. It implies that E(B(vk),x)(v£,y} is 
zero for any bounded operator B and all x, y € % and all k 7^ I. Hence the expansion 
z k = Y^j>o P j ("k-j) provides E(v n+ i,ei)(Z n ,ej) = 0. This shows again ( ]3.22 ). 

We would like to obtain now an estimator for (3 by using a finite sample version of the 
above relations. To this end we set 

1 N 1 N ~ 1 

^(2/) = 77 '%2( Z k,y) z k and d(y) = — ^{Z k ,y)Z k+1 , y£%. 



k=l k=l 

The estimator in [6] and the estimator we also propose here is of the form 



f3(y-K) = ir K C 1 C~i(y-K), 



where 



A 



C-^y;K) = Y J Xj 1 (e J ,y}e 
0=1 



31 



(3.23) 



(Xj,6j) are the eigenvalues (in descending order) and the corresponding eigenfunctions of 
C and px is the orthogonal projection onto the subspace span(ei, . . . , ex)- We notice 
that this estimator is not depending on the sign of the e^'s. The corresponding kernel is 
given as 

N-l K K 

P(t,s;K) = jj—^ £ Y.Y.^f^^^i^M^), ( 3 - 24 ) 

k=l j=l i=l 

and the signs of the ij cancel out. In practice eigenvalues and eigenfunctions of an empirical 
covariance operator can be conveniently computed with the package f da for the statistical 
software R. The estimator (3.24) is the empirical version of the finite expansion 

K K 
i=i j=i 



of ( |3.22 ). 

If the innovations {vk} are i-i-d. Bosq [6j proves under some technical conditions con- 
sistency of the estimator (3.24) when K = K{N): 

\\P-p{K(N))\\ c = op(X) as N ^00. 



10 



The choice of K(N) depends on the decay rate of the eigenvalues, which is not known in 
practice. Empirical results (see Didericksonet al. [12]) show that in the finite sample case 
K = 2, 3, 4 provides best results. The reason why choosing small K is often favorable is 
due to a bias variance trade off. Note that the eigenvalues occur reciprocal in the estimator 
f3 and thus larger K accounts for larger instability if the eigenvalues are close to zero. A 
practical approach is to chose K the largest integer for which \k /\\ > 7, where 7 is some 
threshold. 

Theorem 3.1. Fix some K > 1. Assume that the K + 1 largest eigenvalues of the co- 
variance operator C of satisfy Ai > A2 > • • • > ^k+i > 0. Let f3(K) and $(K) be 
the operators belonging to the kernel functions f3(t,s;K) and /3(t,s;K), respectively. Let 



Assumption 3.1 hold with condition (b) strengthened to a = 4. Then we have 
\\(3{K) - $(K) \\ S = Op (iV- 1 / 2 ) as N 



00. 



In Theorem (3.1) N obviously denotes the sample size which is suppressed in the 
notation. The proof of the theorem is given in Section [5] Our conditions imply that 
E\Z^ < 00. This assumption is probably more stringent than necessary and a relaxation 
would be desirable. Note however, that finite 4th moments are required in [6] even for i.i.d. 

3.2 Simulation study 

In this section we demonstrate the capabilities of our estimators for s) and 5(t) on 
simulated data. We proceed as follows: We will choose a simple f3(t,s) and S(t), simulate 
several days of observations using these parameters, and then use the estimation procedure 



given in Section 3.1 to obtain f3(t, s;2) and 5(t;2) from (3.24) and (3.17) respectively. 



We will use /?(£, s) = 16s(l — s)t(l — t) and 5(t) = 0.01 for our simulations. Now that we 



have chosen s) and S(t) we can simulate data according to (2.1 ) and (2.2 ). We will use 
Ei(t) = Bi(t) + N{ yl — t(l — t) for the error term, where Bi(t) are iid standard Brownian 
bridges and N{ are iid standard normals. Note that this gives E(e 2 (t)) = 1 for all t, which 
is assumed by our estimation procedure. After simulating N days of data we compute 
/3(i, s; 2) and 5(t; 2). Figures |J |ij and Q show the estimates when N = 30, N = 300, and 
N = 3000, respectively. We see from these plots that the estimators described in Section 



3.1 accurately estimate the parameters, /3(t, s) and 5(t), when the sample size is sufficiently 
large. Note that each plot of 5(t; 2) has the true S(t) superimposed. A plot of the true 
f3(t, s) is given in figure [T] 
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Figure 1: 0(t, s) = 16s(l - s)t(l - t) 




0.0 0.2 OA OS 0.8 10 



Figure 2: Using a sample of size iV = 30, we obtain (3(t, s; 2) on the left and 5(t; 2) with 
5(t) = .01 superimposed on the right. 
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Figure 3: Using a sample of size ./V = 300, we obtain /3(t, s; 2) on the left and 5{t; 2) with 
6{t) = .01 superimposed on the right. 
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Figure 4: Using a sample of size N = 3000, we obtain (3(t, s; 2) on the left and S(t; 2) with 
5(t) = .01 superimposed on the right. 
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4 An example 



In this section we show an example illustrating that our model captures the basic features 
of intraday returns. Let Pk(t) denote the price of a stock on day k at time t. Then Uk{t) can 
be viewed as the log-returns of the stock, ?/&(£) = log Pfc(t) — log Pk{t — h), during period 
h (cf. Cyree et al. j 1 0] ) . where h is typically 1, 5, or 15 minutes. We will use h = 5 for 5- 
minute returns. The volatility of the stock is then represented by cr?(t) = Var(yk(t)\J-k-i)- 



The first step to simulating the intraday returns is to estimate the parameters, 6(t) and 
/3(t, s), as outlined in Section 3.1 These parameters were estimated for the S&P 100 index 
based on data from April 1, 1997 to March 30, 2007. The estimated functions, /3(t, s; 2) 
and 6(t; 2), are shown in Figure pi 




Figure 5: Left: j3(t,s;2) estimated from S&P 100 index. Right: S(t;2) estimated from 
S&P 100 index. 

Notice in Figure [i] that P(t,s;2) and 5(t;2) are somewhat larger when t is close to 
or 1. According to ( |2,2[ ) this suggests that the volatility, cr^(t), tends to be larger at the 
beginning and end of each trading day. Higher volatilities at the beginning and the end 
of the trading day have been observed by several authors (cf. Gau [17J and Evans and 
Speight [15J). This phenomenon is consistent with our observed log-return data based on 
the S&P 100 index and is captured by our model. 

Having estimated the parameters, 5(t) and f3(t, s), we can now simulate several days of 
observations according to ((2TTJ) and (Q. We will usee^t) = 2~ 200t v / log(2)W i (2 400 V log(2)) 



for the error term, where Wi(t) are iid standard Brownian motions. Note that this gives 
E(e 2 {t)) = 1 for all t, which is assumed by our estimation procedure. 
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Figure 6: Left panel: Five consecutive days of simulated values for yk{t) ■ Right panel: 
5-minute log-returns for the S&P index between April 11 and April 15, 2000. 

We simulated 5 days of log-returns which we compare with the log-returns of the S&P 
100 index. The right side of Figure [6] is the plot of the 5-minute returns on the S&P 100 
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index between April 11 and April 15, 2000. The left side of Figure [6] shows five consecu- 
tive days of simulated values for yk(t). The simulations show that our model empirically 
captures the main characteristics of financial data. 



5 Proofs 



The proofs of Theorems 2.1 and 2.3 are based on general results for iterated random func- 
tions as those in Wu and Shao |27j and Diaconis and Freedman |llj . For the convenience 
of the reader we shall repeat here the main ideas of |27j . 



Let (S, p) be a complete, separable metric space. Let be another metric space and 
let M : x S — > S be a measurable function. For a random element 9 with values in 
0, an iterated random function system is defined via the random mappings M$(-). More 
precisely it is assumed that 

X n = M dn (X n . 1 ), n G N, (5.25) 

where {9 n , — oo < n < oo} is an i.i.d. sequence with values in 0. Thereby it is assumed 
that Xq is independent of {6 n , n > 1}. For any x G S we define 

S n (x) = M 9n o Me^ o • • • o M 6l (x), n = 1, 2, . . . , 

where o denotes the composition of functions. We also introduce the backward version of 
S n , which is given by 

Z n (x) = M Q _ X o M g _ 2 o • • • o M e _Jx), x G S, n = 1, 2, . . . . 



The following theorem is a slight modification of Theorem 2 of [27J, so that it is immediately 
applicable for our purposes. 

Theorem 5.1. (Wu and Shao, 2004-) Assume that 

(A) there are y$ G S and a > such that E{p(yo, Mg Q (yo))} a < oo and 

(B) there are xq G S, a > 0, < n = ri(a) < 1 and c = c(a) < oo such that 

E{p(S n (x), S n (x ))} a < cr™{p(x,x )} a 

for all x G S and n G N. Then for all x G S we have Z n (x) converges almost surely to 
some which is independent of x. Furthermore Z^ = g((9o,#-i, . . .) and 

E{p{Z n (x),Z 00 )} a < Cl r n , 

where c\ = ci(x,Xo,yo,a) < oo and < r = r{a) < 1. Moreover, the process X n = 



gifini $n-l) • • •) is a stationary solution of (5.25|). Finally, if we let Xq = f(8 , Q,6'_ 1 ,...) 



where {6' n } is an independent copy of{9 n }, then 

E{p(S n (X* ),S n (X ))} a <c 2 r%, 
with some < r 2 = "^(a) < 1 and c 2 = c 2 (a) > 0. 
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Proof of Theorem 2.1 We need to show that the conditions of Theorem 5.1 are satisfied 



when the underlying space is T~L with metric || • ||% and 

Me n (x)(t) = S(t) + I f3(t,s)e 2 n _ 1 (s)x(s)ds. 



To demonstrate (A) of Theorem 5.1 we use yo(t) = 0, < t < 1, and get 

' 2 / cl\ 



|y - M 9 2(y )\\* H = J 5\t)dt < oo, 
by assumption. Since for any x, xq £T~L we have 
\\S n (x) - S n (x )\\ n = \\M d 2(S n -i(x)) - M e 2 (S n -i{xo))\\ H 



2 X l/2 

/3(t,s){S n -i(x)(s) - S n -i(x )(s)}£ n _ 1 (s)ds\ dt 



~ (/{/ /32( *' S)£ '- l(s)(i 7 / {Sn-l(x)(s) - Sn-xixo^sn dsdtj 

= K(e 2 n _ 1 ) ^S n -\{x) - S n -i(x )\\ H , 
by the Cauchy-Schwarz inequality. Repeating the arguments above, we conclude 

n-l 

\\S n (x) - S n (x )\\ n < \\x - x \\ n Y[ K(e 2 ). 
Taking expectations on both sides and using the independence of the e, proves (B). □ 



1/2 



Theorem 2.2 is a simple corollary to Theorem 2.1 and Theorem 5.1 Theorem 2.3 
be proven along the same lines of argumentation and the proof is omitted. 



can 



Proof of Propositions \2A\ and \2.2\ First we establish (|2.7|). We follow the proof of Theo- 
rem 



2.1 



Since E{ ||0q||%} = E{ \ \Z 00 \\u} a , according to the construction in the proof of 



Theorem 12.11 we have 



E{\\a 2 UY = E{\\ Zoo UY 

<E{\\Z 1 (0)\\ n + ||Zi(0) -z^WuY 

< 2 a {E{\\Z 1 (0)\\nY + E{\\Zi{0) - Z^minY}, 



where denotes the "zero function" on [0, 1]. According the proof of Theorem 2.1 and The- 
orem 



5.1 



the term E{ || Zi(0)- Z M (0)|| n } a < oo. Furthermore, the term E{||Zi(0)||^} f 



/ S 2 (t)dtj 2 < oo. To show ( |2.10[ ), we note that 



E {\\yo\\nY = E 



E 



yl{t)dt 



£ 2 (t)a 2 (t)dt 



< J B{||e ||oo} a -E{lko||'H} a , 
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since Eq and <jq are independent processes. Proposition 2.1 is proven. 



The proof of Proposition 2.2 only requires minor modifications and is therefore omitted. 

□ 



Proof of Proposition 2.3. Using recursion 2.1 we have 



u(yo,h)= sup sup \e (t + s)a (t + s) - e (t)a (t)\ 

0<t<l-hO<s<h 

< sup sup {\e {t + s)\\a (t + s) - a (t)\ + \a (t)\\e (t + s) - e (t)\} 

0<t<l-hO<s<h 

< ||eo||oow(cr , h) + ||cro||oow(eo, h). 
The independence of £o and o"o yields 

E{\\a \\ 00 uj(eo,h)Y = E{\\a \\oo} P E{u;(£o,h)} p . 



Proposition 2.2 gives E{ [|cTq ||oo }" < °°- This implies that E{ ||<7o||oo} P < oo and therefore 

lim E{||o- ||oow(eo, h)\ p = 0. 
The identity \yfa — vb\ < y|a — b\, a, b > 0, implies 



oj p (ao,h)= sup sup |o"o(t + s) — o"o(t)| p 

0<t<l-h0<s<h 



< sup sup \a (t + s) — <r (t)| 

\0<Kl-/iO<s</i 



Recursion (2.2) gives 



k 2 (t + S )-a2(t)|<|5(t + s )-^)| + 



{I3(t + s,r)- pfar^yl^dr 



<oj(5,h)+ sup sup sup \(3(t + s, r) — j3{t, r)\ x / y_i(r)dr. 

0<t<l-h 0<s<h 0<r<l 



Hence 



£ (0J p (a , h)) = E{ sup sup \a%(t + s) - crg(t)|} 

0<Kl-/iO<K/i 



< 2 



l{[ sup sup sup \p(t + a,r)-p(t,r)\]' 3 x £{||y ||oo} p + [co(5,h)]H. 

1 0<t<l-h0<s<h0<r<l J 



Proposition 2.2 yields that E{ ||yo||oo} P < oo an d £^{||<7o||oo} P < oo. So by the indepen- 
dence of the processes eo and <7o we conclude 

lim {||eo||oo^(<7o,/i)} P = 0, 



completing the proof of Proposition 2.3 



□ 
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Proof of Theorem 3.1 Under our assumptions it follows from Theorem 2.2 that for any 
m > 1 

E\\Z k - Z fcm ||^ < const x r m 

where r G (0, 1) and Z km are the m-dependent approximations of Z k (constructed by 
using o\ m instead of a\ in the definition of Z k ). This shows that the notion of L 4 -m- 
approximability suggested in Hormann and Kokoszka [19j applies to the sequence {Z k }. As 
consequence we have with q = sign(ej, ej) that 



(a) max_£'||cjej — ej||^ = O (N x ) ; 



(b) max E\\i - XA 2 = O (N 

Ki<K 



-V 



and therefore that 



max Wae-i - ei\\% 

\<i<K 



max 

Ki<K 



Op (iV 1 / 2 ) ; 

Xi - Xi\ = Op (iv- 1 / 2 ) 



(See |191 Theorem 3.2].) The random sign c\ (which we cannot observe) accounts for the 
fact that ej can be only uniquely identified up to its sign. As our estimator /3(K) doesn't 
depend on the signs of the ej, this poses no problem. We define 



and let 



a 



*,3 



N 



1 

— r y^A z k,ei)(z k+1 ,jj 

k=l 



be the empirical counterpart. Then we have 

JV-1 



N 



E 



— r ^ ((Z k ,ei)(Z k+1 ,ej) - E(Z k ,ei)(Z k+1 ,ej)) 
k=l 

1 N ~ L 

v _ 1 ^ ({Z k ,Ciei)(Z k+1 ,Cjej) - (Z k ,ei)(Z k+1 ,ej)) 



=:ri(i,j;iV) + r 2 (i,j;iV). 

The processes Z k = Z k (i,j) = (Z k ,ei)(Z k+ i,ej) are strictly stationary for every choice of 
i and j and we can again define the approximations Z krn in the spirit of Section [2] We 
have by independence of Zq and Z kk 



\Cov(Z ,Z h )\ < EZl + {EZl) l/2 x Y, { E ( Z h ~ 2, 



hh) 



^2^l/2 



h>0 



h>l 



< -^ll^oll^ + EWZqW^ x (E(Z h - Z hh ) 



2\V2 
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Further we have by repeated application of the Cauchy-Schwarz inequality that 

E(Z h - Z hh ) 2 = E [(Z h , ei)(Z h+ x, ej) - {Z hh , ei){Z h+1>h , ej)} 2 

< 2^E[(Z h - Z hh ,ei)(Z h+ i,ej)] 2 + E [(Z hh ,ei)(Z h+1 - Z h+1>h , e,)] 2 } 

< 2 {E{Z h - Z hh , ei ) 2 E(Z h , ej ) 2 + E{Z hh , ei ) 2 E(Z h+1 - Z h+1>h , ej ) 2 } 

< 2E\\Z \\l i {E\\Z h — ZhhWu + E\\Z h+ i - 



< const x r 



h 



for some r £ (0, 1). This proves that the autocovariances of the process {Z^} are absolutely 
summable. A well known result in time series analysis thus implies that 

(N - l)Var (j^rj Z *) ^ 2 E l Cov (^o, Z h )\ < c , VN > 2, 

V fe=l / h>0 

(see e.g. the proof of Theorem 7.1.1. in Brockwell and Davis [H]) where, as we have shown, 
the constant cq is independent of the choice of i and j in the definition of Z\., Hence 
max l<i,j<K N) = Op (JV-Va). 

Using relation (a) above, one can show that also maxi<jj<^ T2(i, j; N) = Op (A^ 1 / 2 ). 
We thus have 



(c) max \<Jij — diCjdijl = Op [N 1 ^ 2 

l<i,j<K ' \ 



We have now the necessary tools to prove Theorem 3.1 By relations (a'), (b') and (c) 
we have that 



P(K) - p[K) 

l<i,j<K K 



a 



e, ® e, — e,- <g> ej 



+ 



J J ® ^ * 1 1 s 



< K 2 1 P (n~^ +O p (1 
The proof follows from \\ej ® ej — c^e^ ® Cjej|| 5 < Hcjej — ej||% + || c^ei — ej||% 



max \\e-j <&> ej 
l<i,j<K 



□ 
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